Word category and prosodic emphasis in dialog modules of speech technology applications
نویسنده
چکیده
The prosodic behavior of word categories in Modern Greek is observed in relation to two keyword-based Speech Technology applications, namely for a Speech Recognition system in the healthcare domain for task-oriented dialogs intended for senior citizens and for the prosodic modelling of utterances produced by a Conversational Agent in a dialog system for consumer complaints. Word categories are evaluated and classified in the two applications, in respect to the types of words contained in each category, their relation to keywords and their effect on the user.
منابع مشابه
Recognition of Out-of-vocabulary Words and Their Semantic Category
In almost all applications of automatic speech recognition, especially in spontaneous speech tasks, the recognizer vocabulary cannot cover all occurring words. There is always a signiicant amount of out-of-vocabulary (OOV) words even when the vocabulary size is very large. In this paper we present a new approach for the integration of OOV words into statistical language models. It is based on t...
متن کاملProsody Modeling for Automatic Speech Recognition and Understanding
This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automati...
متن کاملDirect Modeling of Prosody: An Overview of Applications in Automatic Speech Processing
We describe a “direct modeling” approach to using prosody in various speech technology tasks. The approach does not involve any hand-labeling or modeling of prosodic events such as pitch accents or boundary tones. Instead, prosodic features are extracted directly from the speech signal and from the output of an automatic speech recognizer. Machine learning techniques then determine a prosodic m...
متن کاملDetermining High Level Dialog Structure without Requiring the Words
The potentially enormous audio resources now available to both organizations, and on the Internet, present a serious challenge to audio browsing technology. In this paper we outline a set of techniques that can be used to determine high level dialog structure without the requirement of resource intensive, accent dependent, automatic speech recognition (ASR) technology. Using syllable finding al...
متن کاملCan Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?
Identifying whether an utterance is a statement, question, greeting, and so forth is integral to effective automatic understanding of natural dialog. Little is known, however, about how such dialog acts (DAs) can be automatically classified in truly natural conversation. This study asks whether current approaches, which use mainly word information, could be improved by adding prosodic informati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008